INTEGRAL POINTS ON ELLIPTIC CURVES AND THE 
BOMBIERI-PILA BOUNDS 



DAVE MENDES DA COSTA 



Abstract. Let C be an affine, plane, algebraic curve of degree d with integer 
coefficients. In 1989, Bombicri and Pila showed that if one takes a box with 
sides of length N then C can obtain no more than Od,c (Af 1 / d+e ) integer points 
within the box. Importantly, the implied constant makes no reference to the 
coefficients of the curve. Examples of certain rational curves show that this 
bound is tight but it has long been thought that when restricted to non-rational 
curves an improvement should be possible whilst maintaining the uniformity of 
the bound. In this paper we consider this problem restricted to elliptic curves 
and show that for a large family of these curves the Bombieri-Pila bounds can 
be improved. The techniques involved include repulsion of integer points, the 
theory of heights and the large sieve. As an application we prove a uniform 
bound for the number of rational points of bounded height on a general del 
Pezzo surface of degree 1. 



1. Background and Motivation 

Let C be a plane curve (projective or affine) defined over Q. The most basic 
number theoretic question one can ask about C is to characterise the cardinality of 
the sets C(Q) and C(Z) (the latter being of interest when the curve is affine). An 
aim of Diophantinc geometry is to classify the nature of these sets in terms of the 
geometry of C. This aim has seen spectacular success in the 20 th century with the 
genus of the curve g(C) emerging as a key player. 

In this paper, we shall be concerned with integer points on affine curves (in partic- 
ular the usual affine patch on an elliptic curve). The main theorem in this area is 
Siegel's Theorem which teaches us that if g(C) > 1 then #C(Z) is finite. (In fact 
it says that this is true with Z replaced by the ring of S'-integers of any number 
field.) When C is rational (i.e., g(C) = 0), examples exist where C(Z) is finite and 
others where it is infinite. This behaviour is well understood. 

After one has Siegel's Theorem, a natural question is to ask for a more effective 
bound. Such results in this area tend to depend on the curve in question, indeed 
they must since there is no uniform bound for the number of integral points on 
a non-rational affine curve. Since one must make reference to the curve in such 
bounds the next step is to decide which features of the curve we wish to permit 
ourselves to use. This is a matter of taste and different types of bounds have dif- 
ferent applications. 

In this paper we shall be following the tradition of trying to maintain as much 
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uniformity as possible. As we have noted, if we desire results which are very uni- 
form but yet still apply to a large number of curves then it is difficult to study the 
cardinality of C(Z). Thus attention shifts to questions concerning the density of 
integral points. In particular one may ask how the counting function 

C(Z,N) :=#{(x,y)eC(Z) : \x\ , \y\ < N} 

behaves as N > gets very large. Since we want our results to be uniform, an 
asymptotic result is not the correct thing to be looking for. Instead we would like 
a bound on this function which depends on N and on as little to do with the curve 
as possible. If we were to look to bound C(Z, N) independently of any information 
about C then the best we could do would be the trivial bound C(Z, N) <C N 2 since 
there are curves of high degree which subsume all the integer points in the box. 
Therefore we should ask for a bound in terms of the degree of the curve. 

The breakthrough result in this area was reported in 1989 when Bombicri and 
Pila [3] showed that if C is a plane curve of degree d then 

(1) C(Z,N) <Cd, e N x / d+t . 

This bound is tight as is seen by considering the curves 

C d :y = x d 

The interesting thing about this family is that it consists of rational curves only. 
Indeed, the only known examples where (1) is tight is when C is rational. From 
the point of view of Diophantine geometry this is intriguing since it once again 
points to differing arithmetic behaviours being catagorised by geometry. Indeed, 
this prompts us to ask the following question: 

Question 1. If C is a non-rational curve of degree d then is there a 5(d) > 
depending only upon d such that 

C{Z,N) <C d JvVd-*(«0? 

Currently, the best evidence that this question has a positive answer is that the 
corresponding question for rational points does. In 2000, Heath-Brown [8] showed 
that if one takes a plane projective curve of degree d and counts points of (naive) 
height less than N (that is, points P = [x : y : z] such that x 7 y,z are coprime 
integers of absolute value less than or equal to N) then for the associated counting 
function one gets 

C(iv,Q)« d , e N 2 / d+e . 

Once again the only examples where this is tight are the projective versions of 
the curves we mentioned before and so the same question as above can be asked 
about non-rational curves. This was answered in the positive in 2005 by Ellcnberg 
and Venkatesh [6]. So there is hope that our question has a positive answer and 
this answer is in reach even though the methods in [6] do not directly apply. 
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This paper is concerned with the case where C is a non-singular degree 3 non- 
rational curve, i.e., an elliptic curve. We shall show that for a large class of such 
curves we can give a positive answer to the Question. In particular we show the 
following two results. Firstly we have our 

Main Theorem. Let E be an elliptic curve given by the equation E : y 2 = f(x) 
where f £ Z[x] is a monic cubic with no repeated roots. Then there is a 8 > 0, 
independent of E, such that E(Z,N) <C N 1 / 3 ^ 6 (the implied constant is uniform). 



Our second result strengthens and extends this to a wider class of elliptic curves 
but requires more hypotheses: 

Theorem 1. Let E be an elliptic curve given by the equation y 2 = f{x) where 
f £ Z[x] is a monic cubic with no repeated roots. Suppose further 04(E) < and 
j(E) > e for some e > 0. Let B be any box in the plane with sides of length N . 
Then there is a 5 > (depending only one) such that # (E(Z) H B) = O e (A rl/3 ~ <5 ). 

1.1. Acknowledgments. I am glad to acknowledge the useful conversations and 
insights that many people have shared with me. In particular, I extend my warmest 
regards to Tim Browning, Tim Dokchitser, Roger Heath-Brown, Marc Hindry, 
Pierre Le Boudec, Dan Loughran, Joe Silverman and Trevor Wooley. Many thanks 
go to my advisor Harald Helfgott who gave me this problem and has always been 
generous both as a supervisor and a friend. I would also like to thank the Univer- 
sity of Bristol and the Ecole Normale Superieure for providing me with wonderful 
working conditions during my PhD. Finally, dues are paid to the EPSRC Doc- 
toral Training Award, which has funded my research, with the hope that they may 
continue to support such pure mathematical endevours. 

2. NOTATIONS AND CONVENTIONS 

We shall let E denote an elliptic curve throughout. Our problem is phrased with 
respect to particular embeddings of E and so we shall be assuming that the model 
of E takes a Weierstrass form. We say that E is in long Weierstrass form when we 
have 

E : y 2 + a x xy + a 3 y = x 3 + a 2 x 2 + a 4 x + a 6 

and in short Weierstrass form if we have 

E : y 2 = x 3 + Ax + B 



where ai,A,B £ Z. We make no assumptions on the minimality of E. We also 
associate several standard quantities with E, namely the discriminant A#, 04(E) 
and the j-invariant j(E). In the case of a curve in short Weierstrass form these are: 

A E = -16(4A 3 + 27B 2 ), c 4 (E) = j(E) = -1728^. 



In our bounds we shall make use of both Landau's Big-Oh notation and Vino- 
gradov's <C notation. Given functions /,j:R->Kwc shall write either 
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f(x) = 0{g{x)) or f(x) « g{x) 

in the case that there is a constant C > such that |/(x)| < C \g(x)\. In particular 
/ = 0(1) implies / is bounded. The constant C is suppressed by the notation and 
so is referred to as the implied constant. If we wish to draw note to the dependency 
of the implied constant on something (such as the curve E) then we shall do so in 
subscript writing, for instance, 

fix) = E (g(x)). 

We shall make reference to several different heights in what follows. Let P = 
[x,y) € Z 2 . By the naive height of P we shall mean the maximum of \x\ and \y\. 
When dealing with points on an elliptic curve we shall be studying the canonical 
height h on E. Despite its canonical nature, there are still several ways of scaling 
this height which are used in the literature. We shall be using the scaling which 
corresponds to seeing h as a sum of local heights as defined in [13]. This is half the 
size of the version defined by Tate. 

3. Repulsion techniques 

The majority of the work in this paper goes into putting us into a situation where 
we can use a result about how integral points can be said to repel one another. The 
idea of rational points repelling each other has a wonderful pedigree. In 1969, 
Mumford showed that if C is a curve of genus g > 1 then two rational points 
P, Q € C(Q), which in the Jacobian of C have comparable canonical height, repel 
each other in the sense that the angle between them in the Mordell-Weil lattice of 
Jac(C) is at least arccos(^). This observation, known as Mumford's Gap Principle, 
allowed Mumford to show that the number of rational points on C of naive height 
less than TV is 

Oc (log log N). 
This result was improved by Faltings who showed that 

#C(Q) = O c (l). 

This bound can also be seen as a consequence of repulsion, the key difference being 
that the condition that the points have to be of comparable canonical height is 
shown to not be necessary to force repulsion. So repulsion has a lot to say about 
rational points; how about integral points? 

In the case of an elliptic curve, Mumford's Gap Principle says nothing about ratio- 
nal points other than they repel each other by an angle of at least 0! However, if we 
restrict to integral points then we obtain a non-trivial repulsion of at least 60° be- 
tween points which have comparable canonical height. This phenomenon is present 
in the work of Silverman who showed how to integrate the notion of (S-)integrality 
of points into such repulsion techniques in [14] (later refined with Gross in [7]). If 



INTEGRAL POINTS ON ELLIPTIC CURVES AND THE BOMBIERI-PILA BOUNDS 5 



wc demand that the points reduce to the same point modulo some prime p, we can 
increase the minimum angular repulsion between points at the cost of having to 
partition the points into the fibres of the reduction map. To turn this repulsion into 
a bound on the number of integral points of canonical height less than h, we simply 
slice up the set of integral points into strips with comparable canonical height, i.e., 

(1 - e) log TV < h(P) < (1 + e)logiV 

for some TV > 0, and then to each slice apply bounds for the number of vectors 
which can packed with suitable angular repulsion onto a sphere of dimension r — 1 
where r is the rank of E(Q). The idea of gaining bounds through slicing and 
sphere-packing was first seen in the thesis of Helfgott and in the paper [9] . These 
ideas culminated in the paper [10] of Helfgott and Vcnkatesh who show how this 
repulsion can be obtained in a uniform manner and also how the choice of the prime 
p can be optimised to yield the following bound: 

Theorem 2. [10, Corollary 3.9] Let E be an elliptic curve over Q with integer 
coefficients and e > 0. Then there is a 5 > such that the number of integral points 
of E of canonical height less than h is 

0e ( e Ml-«)+e) 

where the implied constant does not depend on E. 

Thus we see that this theorem yields the desired result when h < 3 log N and 
in fact we can let h be a touch larger. In particular, the key to our application of 
Theorem 2 will be the following result. 

Proposition 1. Let E be an elliptic curve in short Weierstrass form and N, 5 > 
such that \A E \ < N 4 + 65 and \A\ < N 4 / 3 + 26 . Then if Q = {x,y) e E(Z) has 
\x\ < N 2 / 3 + 5 then 

h(Q)< Q + 01ogiV + O(l) 
where the implied constant is absolute. 

4. Proof of Proposition 1 

The idea of the proof is to break up the canonical height, h, as a sum of local 
canonical heights A p , one for each prime p and then bound each part. We shall use 
the local heights as they are defined in [13, Chapter VI]. 

For the finite primes we shall use the following bound, due to Tate. 

Lemma 1. Let p be a finite prime, X p the local height of E at p and let Q € E(Z). 
Then 

\ ir>\ ord p (A E ) 
A P (Q) < — ^ log p. 

Proof. [11, Chapter II, Theorem 4.5] 

□ 
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Thus we see that for Q € E(Z), 

(2) ^A p (Q)<llog|A £ |. 

This leaves just the infinite part to estimate. Let us refresh ourselves on the theory 
of elliptic curves over C. 

Every elliptic curve E over C is isomorphic to C/A for some rank 2 lattice A C C. 
The isomorphism is explicit and comes from the Weierstrass py\-function which 
maps C/A to E(C) and is defined by 

Pa(z) = \ + J2 ( 2k + l)G 2fe+2 (A)z 2fc 

z fe>0 

where 

G »( A )= £ i 

o/weA 

is the weight 2k Eisenstein series associated to A. The isomorphism between C/A 
and E(C) is given by 

(p A {z),p' A (z)/2). 

In this case the equation for E can be recovered from A via 

E:y 2 = x 3 - 30G 4 (A)a; - 70G 6 (A). 

Two curves C/Ai and C/A 2 are isomorphic over C if and only if there is some 
u e C x such that uAi = A 2 . In this way one can normalise the form of A to be 
A T =< 1, t > where 

t € |z £ C : |Re(z)| < ^, \z\ > l| =: T. 
For such a r let E T be the curve associated to A T . 

Another way of characterising two curves being isomorphic is via the j-invariant 

JE = -1728^-. 

Two curves, E\ and i? 2 are isomorphic if and only if je — je 1 ■ Moreover, if E is 
defined over a field K then 2e S E. The curves we are interested in are defined over 
Q and thus over R, so it is sensible to ask which curves in T have real j-invariant. 
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Lemma 2. Let E/M. be an elliptic curve. Then there is a unique r in the set 

C = {ib:b>l}U jt 
such that j(E T ) = j(E). 

Proof. [13, Chapter V, Prop 2.1, p.414] □ 

If j(E) = j{E T ) then we shall say that E is associated to r and if the isomorphism 
between E and E T is defined over a field K we shall say it is associated to t over 
K. Let us label the three sets which make up C by Ci, C 2 and C 3 respectively. These 
regions neatly separate up the elliptic curves defined over BL The curves which are 
associated to r £ C\ are those with two real components. These curves have A < 
and j > 1728. The curves associated to t <G C3 have one real component, A < and 
j < 0. Finally, the curves associated to r £ Ci have one real component, A > and 
< j < 1728. It will be noted that no mention of the B coefficient has been made. 
This is because we can change the sign of the B coefficient via the isomorphism 

(x,y) 1 y (-x,iy) 

It is worth noting that in terms of the underlying lattices, this isomorphism is 

Ah -A 

i 

and so consists of rotating the lattice through 90°. We shall refer to the image of 
E T under this map as its twist. We note that every elliptic curve defined over M. is 
isomorphic over R to either E T or its twist for some r e C. 

We shall now look at the set E(1H) in a bit more detail. Let E(C) = E T (C) and 
consider a fundamental parallelogram for A r 

V T = {xi + x 2 t :x 1 ,x 2 e (-1/2, 1/2]} . 

For every elliptic curve E which has j(E) € M, there is a u e C x such that uA = uA. 
Let A' = uA. Then pA'(z) = pA'(z) and so the real points are those which are 
invariant under the action of complex conjugation on C/A'. 

For curves E T with r E C\ or C3 we already have A T ~ A T . This means that 
in the case of C\ , the (two) real components are the images of the points 

xi + x 2 t e V T ■ x 2 £ jo, 1 



2 

and for C3 the (one) real component is the image of 



{zi + x 2 t £ V T : x 2 = 0} . 
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In the case of C 2 we need to modify A T to A' 



A. This has the effect of rotating 



the point 1 + r onto the real axis. Thus the real component is the image of 



The final thing left to note is that if we take the twist of E T (which flips the sign 
of the B coefficient), call this twist E' T , then since this rotates the lattice by 90° 
it is easy to find the points of V T which correspond to points on E'(R) (after we 
rotate the lattice). They are the points which, after we rotate A' T by 90°, map to 
themselves under complex conjugation. These are the points on E(C) which are 
mapped to E'(R) by the twist. 

Let t € C and let E be associated to r over R. We want to study Aoo(u) for those 
u e V T which are mapped to E(R) and where the naive height of the x-coordinate is 
bounded. We have now identified the points which map to E(R) and later we shall 
find conditions on u which correspond to the bound on the x-coordinate. We also 
wish to study the same points for the twist E' of E. Rather than studying Aoo on 
the fundamental domain for E', it is sufficient for us to understand the behaviour 
of Aoo(u) for those u e V T which correspond to points on E'(R) after we twist. 
This is sufficient since the value of Xoo(u) does not change as we twist E to E' . 
Therefore we shall, in the sequel, study the behaviour of \oo{u) for those u € V T 
which map to either E(R) or to the image of E'(R) in E(C). 

We now have enough background to start examining the archimedean local canon- 
ical height. 

Take a point u = u\ + u 2 t e V Tl then following [13, p. 468] we can define the 
archimedean local canonical height to be 



where q — e 27 " T , B 2 (x) = x 2 — x + 1/6 is the second Bernoulli polynomial and 



In order to analyse the behaviour of for points of E^(R), we note that there 
is a similarity between certain terms in Aoo and the Jacobi product formula for the 
discriminant function when viewed as a function on J 7 , 




Ac» = -\b 2 {u 2 ) log \q\ - log |1 - t\ - log J] (1 



q n t)(l - q^- 1 ) 



n>0 



(3) 



A(t) = A(B t ) = (2i) 12 ? H(1-5") 



n>0 



We can use this similarity to our advantage. 



Lemma 3. Let t e T . Then 



Aoo(«) < - log 1 1 - t| - ^-log|A(r)| + 0(1) 
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where the implied constant in the 0(1) is absolute. 

Proof. First we note that since Aoo(u) = Aco(— u) for all u G V T we can choose u 
such that u 2 G [0, 1/2]. Noting that \q\ < 1 and that B 2 (u 2 ) G [-1/12, 1/6] we have 

-^■B 2 (w 2 ) lo § 1^1 - ~T2 log I 9 ' ' 

This corresponds to the q term in A(t). Next we note that t = e 2mu = e 2mui q u *, 
and so 



n>0 



^ — e 27rittl q n + U2 ) (1 — p _27r '"i q n ~ U2 ) 

> n (i — kr + " 2 xi — kr~" 2 ) 

n>0 

> n^-i^na-kr 172 ) 

^ 7rn )(l - e ~ v ^ 7r (™~ 1 / 2 ') 



n>0 

> H(l-e- 

n>0 

= 0.92984... 



where in the penultimate line we use the fact that Im(r) > Therefore Aoo(ii) < 
— j2 l°g M — log 1 1 — t| + 0(1), which only leaves us to show that 

log|A(r)| =log|g|+0(l) 
and this is the case since the lower bound on Im(r) allows us to deduce that 



21.588 < log 
Thus we have our result. 



(2K) 12 X\{l-q n f 



n>0 



< 22.4554. 



□ 



Here is the strategy for the rest of the proof. Let E be given in short Weicr- 
strass form and suppose that \A E \ < N i+6S and \A\ < N A ^+ 2S . There is a r G C 
such that E and E T are isomorphic over C. We want to understand the behaviour 
of Aqo at points of V T which correspond to points Q = (x(Q),y(Q)) G E(R) and 
satisfy \x(Q)\ < N 2 / 3+s . Since E and E T are in short Weierstrass form we know 
that there is a w 6 C such that 



(4) 



E T (C) ^ E(C) 
(x,y) i ^ (w 2 x,w 3 y) 



and so we have x(Q) = w 2 p T (u) since (p(u), p'(u)) parameterises E T (C). The idea 
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of the rest of the proof is as follows: we can deduce the size of w from the ratio 
of and A(r) since the discriminant is a weight 12 modular form and so the 
ratio is w 12 . Next, we use this to get an upper bound on the size of |p(u)| which 
is necessary for |x(Q)| to be smaller than N 2 ^ 3+s . Finally we can use this upper 
bound to get a lower bound on This bound will turn out to be just what we 
want in order to cancel out the contribution from the local heights at the finite 
primes. 

We shall start by understanding p T a bit better and since 



fe>i 



it makes sense to start with the G 2k - 



Lemma 4. For r e C and k > 2 we have \G 2 k(r)\ < 80. 



Proof. We have 



\G 2k (i 



E 



1 

~?2k 



< 



E ,,,2k - E 



where the last line is justified by the fact that, for r in C, the \ui\ > 1 for every 
weA,. Thus it only remains to bound this last term for all r £ C. 

For t = bi e Ci we have 



\G 2k (r) 



< 



< 



1 



E 

V 

(m> n)ez2-{0} \m + nbi\ q 



(m 2 + (nb) 2 ) 2 
1 

(m 2 + n 2 ) 2 



' 7. 



Similarly, if we consider r = \ + bi with b > \ then we have 
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\G 2k (r)\ < J2 



£ 1 - 

(m,n)eZ2-{0} |™+(§+W)n| 

- E 



(m 2 + ((|+^)n) 2 ) 2 



- E 



i 



< 



E 



(to 2 + (nb) 2 +n 2 /4) 2 
1 



(m 2 + (|) 2 ) 2 



80. 



This just leaves r e C 2 . For this we note that the transformation 



ct ; t i — y 



— 1 

T - 1 



takes C 2 to the line segment 



Since G 2 k is a weight 2fc modular form we have the relation 



\G 2k (r)\ = \r-l\- 2k \G 2k (a(r))\ 
< |G 2fc (a(r))| 
80. 



Thus |G 2 fe(r)| < 80 for all r e C. 
With this estimate we learn the following fact about 
Corollary 1. Let t e C and tie £> r . T/ien 



|pr(«)| > na - 10 °- 
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Proof. First note that from the previous lemma and the fact that |u| < |, since 
u eP T , we have 



]T(2fc + l)G 2fc+2 (T), 



fe>i 



.2k 



< 



< 



< ^(2fc + l)|G 2fe+2 (r)| 
fc>i 

80^(2fc+l)|u| 2fc 
fe>i 

80^ (2fc+ 1)(0.5) 2 



2/c 



fe>l 

= 97.7778 
100 



and so wc have 



\Pt(z)\ > 



^(2fc + l)G 



2 k 



> 



fc>i 



- 100. 



(Of course this bound is trivial unless |tt| < ^00") 



□ 



We shall now restrict our attention to points u £ V T which map to E T (R) or E' T (R) 
(after the twist). Our aim is to use the above bound to get a lower bound on |u| for 
those ueV T which map to points Q on E(R) or £"(R) such that \x(Q)\ < N 2 / 3+s . 



Since the result of Corollary 1 is trivial unless |«| < 
hypothesis giving an upper bound on 



/100 



wc shall need to make a 



Lemma 5. Consider t <G C and let E be the elliptic curve associated to r over R. 
Consider those u e V T which map to points Q = (x(Q),y(Q)) in E(C) such that 
\x(Q)\ < N 2 / 3+s . Then if \u\ < -4^ we have that 



/200 



| U | > JL JV*/12-1/3-V2| A(t) |-1/12 

v2 

where rj > is such that \Ae\ — N v . 

Proof. We shall argue by the contrapositive. Thus we wish to find a value f3(N) > 
such that if |u| < /3(N) then we have \x(Q)\ > N 2 / z+s . Since E and E T arc in short 
Weierstrass form we know that there is a w € C such that 



|w; 12 A(t) 



Thus 



2 1 =N^ 6 \A(t) 



-1/6 



This means that 



INTEGRAL POINTS ON ELLIPTIC CURVES AND THE BOMBIERI-PILA BOUNDS 13 



\x(Q)\ = \w 2 p(u)\ 

= |A(t)|- 1/6 |p(u)| 

and so if wc want \x(Q)\ > N 2 / 3+s then 

(5) N 2 ' 3+s < iV"/ 6 |A(T)r 1/6 |p(u)| 



is sufficient. Now, since |u| < ^= then Corollary f tells us that 

\p(u)\ > i - 100 > 1 



I |2 — | i2 • 

\u\ 2 It 



To ensure that condition (5) is met we have the stronger condition: 



N 2/ 3+ 8 K Ij^/e | A (r)|- 1/6 -X 
2 Id' 



and so we see that \x(Q)\ > N 2 / 3+s if 

| U | < _jL J^/12-1/3-V2| A(t) |-V12 

V2 

□ 

The final step is to find an upper bound on — log |1 — 1\. We start by making a 
small but useful observation. 

Lemma 6. Let x e [0, 1], then 

|l-e fa |>| 

and 

|l-e-|>f. 

Proof. This is easily seen after expanding the power series for e~ x and e lx . □ 

Lemma 7. Le£ u E V T be a point which maps to E(M) or the image of _E'(R) in 
£(C) and suppose u satisfies 1^/12-1/3-5/2 |A(r)f 1/12 < |u| < , £/ien 



\o g \l-t\<[\ + 5 --^\\ogN+^-\og |A(r)| + 0(1) 



V2 1 V ;l ^ 11 ^ V200 

H - log TV H 

3 2 12 y 8 12 

where the 0(1) is absolute. 

Proof. Let t = oz € C\ or | + 6z E C3. Since we are interested in the points of V T 
which map to a point in either E(R) or E'(R) we recall from our previous discussion 
of such points that u is either in the interval [0, 1/2] or [0, bi/2) depending on the 
sign of B in the equation for E. (Note that we are using the assumption that 
\u\< ^= here to ensure that we are on the connected component containing the 
identity element.) If we are on the interval [0, 1/2] then we have 
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- log 1 1 -t\ 



< 
< 



log|l-e 2 
log |7tm| 



log 



7rJV ,/12-l/3-tf/2 A(T j-l/ 



12 



^ + ^ll) 1OgAr+ l^ l0g|A(T)l+0(1) 



where the second line is justified by Lemma 6. Next, let u € [0, bi/2]. Then we 
have 



log j 1 - t| = 

< 

< 



-log 
— log \nu 
-log 
1 



1 _ e -^\u\ 



Jj_ N v/i2-i/3-8/2 A , T yi/i2 
V2 



i + S --IL\\ogN+±\og\A(T)\+0(l) 



where the second line follows from the fact that 2tt \u\ < < 1 and Lemma 6. 

This gives us our result in this case. Only C 2 remains. Let r = e lB then we 
have u on the line 



x(l + cos 8 + i sin 6) : — - < x < - 



1} 



This means that wc have 



M = \ x 

| a/2(1 + cos6»). 



We can assume that x > and so we have 



log 1 1 - t| 



log 



1 - e 



27ria;(l+cos 8+i sin 8) 



< — log 1 — e 



-2-kx sin 9 I 



log 



— 2tt|ti| sin g 



Since t e C we have | < 9 < | and thus ^ < sm an( } CO s# > 0. Hence 
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- log 1 1 - t| < 

< 

< 
< 



log 
log 

log 

1 



1 - e" 
V3ttu 



7v r,/12-l/3-5/2 A ( T )-l/12 



3 + ^-^logiV+llog|A(r)|+0(l) 



and so we have shown the lemma to be true. 



□ 



Corollary 2. Let E be an elliptic curve over Q given by E : y 2 = x 3 + Ax + B. 

Let N > and Q = {x{Q),y(Q)) e E(Z) with \x(Q)\ < N 2 / 3+s . Suppose further 

that E is associated with t and Q corresponds to u G V T with \u\ < ' Then 

the statement of Proposition 1 is true in this case, i.e., 

hq)< Q+Qiogiv+oa) 

where the implied constant is absolute. 

Proof. Let tj > be such that \A E \ = N v . Then 

HQ) = ]Ta p (q) 

p<oo 

< iloglAsl+AooM 

< ^logJV-Iog|l-t|-^log|A(r)|+0(l) 

where the second line follows from the inequality (2) derived from Lemma 1 and 
last line is justified by Lemma 3. Now, since |a;(Q)| < N 2 / 3+s we see that Lemma 5 
and Lemma 7 can be combined to obtain 

HQ) < ^log7V+Q + ^-^logiV+^log|A(r)|-^log|A(r)|+0(l) 

= Q + £)logiV + 0(l) 
as required. 

□ 

We now have to deal with the remaining case where u € V T has ^L__ < |m| which 
includes the cases of points on the components of E(R) not containing the identity 
(if they exist). 

Lemma 8. Let E be a curve given by E : y 2 ~ x 3 + Ax + B. Let |As| = N v and 
suppose that \A\ < N 4 / 3+2S for some S > 0. Suppose further that E is associated 
with t £ C , then for N larger than an absolute constant, 
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Im{r) < max { — (4 + 66 - -n) log N + 0(1), D 

[ ZTT 

where D > and 0(1) are absolute. 

Proof. First suppose that r e C^. Then Im(r) < 1 so this region is taken care of so 
long as we have D > 1. Now assume that t e Ci U C3. We have 

- 1728 < CiV 4 ^ 5 -" 

I I 

where C = 1728 x 64. Since j(^) = 3{E T ) we can use the (/-expansion of j(r) 

^ n>0 

where c(n) € Z >0 . By a result of Petersson we have 

c{n) 



as n — > 00. Therefore there is some absolute D > such that if Im(r) = b > D 
then we have 

Thus we have 

e 27Tb <2\j(E)\ < {2C)N 4+es - , >. 

Allowing TV to be suitably large we arrive at our desired bound 
, /4 + 6<J-rA, A7 log2C /4 + 65-jA, Ar 



□ 



This lemma puts us in a position to deal with the remaining points. 



Lemma 9. Let E be given by E : y 2 = x 3 + Ax + B and suppose that \ A\ < N 4 ^ 3+2S 
and \ Ae\ = N 71 for some 77 > 0. If E is associated with r and n£P T is such that 



j= < \u\ then either 



A x (»< (I + ^-^)l„, V + Ofl) 



where the 0(1) is absolute or \qo(u) is bounded absolutely. 
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Proof. We shall work by using the bound from Lemma 3: 

1 



Aoo(w) < — log 1 1 -t\ 



12 



log|A(r)|+0(l). 



First of all we wish to bound the term — log |1 — t\ absolutely. Let u = a + hi so 
that we have 



■loell-tl 



log 



1 



2iri(a+bi) 



Our aim is to show that R — e 27Tl ( a + bl ) j s suitably far away from 1. Since |u| = 
|a + &«| > 'T/^oo we nave that one of |a|,|b| > ^j. Suppose that \a\ > ^, then 



since the argument of the complex number R is 2na and \a\ € [gg, ^] we see that 
\R- 1| > 2 sin (^) > 0.3. Thus in this case 



log 



1 



a 27ri(a-\-bi) 



< -log(0.3) < 1.21. 



Suppose now that |6| > ^. Then \R\ < e^' 10 and so \R - 1| > 1 - e^/ 10 > 0.27. 
Therefore we have 



-log 



I _ e 2iri(a+bi) 



< log(0.27) < 1.31. 



Therefore we see that — log |1 — t\ < 1.31. Next, suppose r e C\ U C3 and Im(r) 
is greater than the D in Lemma 8. Since Im(r) > ^ we see from the proof of 
Lemma 3 that 

|A(r)| = |g|+0(l) 
where the 0(1) is absolute. So we have 
1 



12 



log|A(r) 



1 

~12 
1 

_ 12 
7rIm(T 



Iog|g|+0(l) 
loge- 27rIm M +0(1) 
+ 0(1) 



< 



6 

4 + 65-7] 
12 



log7V + 0(l) 



1 5 T] 
3 + 2 ~ 12 



log7V + 0(l) 



where the penultimate inequality is from Lemma 8. This is the result we require. 
If Im(r) is not larger than D then it is uniformly bounded and thus so is |A(r)| 
which means that we have 



Aoo(u) = 0(1). 
This completes the proof of the lemma. 



□ 
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We are now in a position to prove Proposition 1. 

Proof of Proposition 1: Recall that we are trying to show that 

HQ)< Q + £)iogiv + o(i) 

where the O(l) is absolute. We have the local height decomposition 

HQ) = E vq)- 

p<oo 

Lemma 1 tells us that 

E VQ)<^l°g|A S |- 

For the local height at the archemedian place, Lemma 9 and Corollary 2 both imply 
that we have either 

A 00 (Q)< Q + ^logAT- ^log|A s | + 0(l) 



or we have 



X^iQ) = 0(1) 



where the 0(1) in both cases is absolute. In the first instance we can sum the local 
heights together to get the bound on h(Q) we desire. In the second instance we 
need to invoke the hypothesis that |A#| < N 4+6S to attain the desired result. □ 

5. The Large Sieve 

The Large Sieve deals with problems of the following kind. Let £ C [-N, N]C\Z, 
V be a set of primes and S p be image of S under the map Z — > Z/pZ. Suppose 
that there is an < a < 1 such that \S P \ < ap, for every p e V . Then what can 
we say about |5|? 



Lemma 10. (The Large Sieve) Let S C {M, ...,M + N} for some M e Z and 
N > 0. Take X > and let V be a set of primes each no greater than X. Suppose 
that there is an a > such that \S P \ < ap for each p e V . Then 

a(N + X 2 ) 

151 " (1 - a) \V\ ■ 

Proof. This follows from, e.g., [5, Chapter VIII, Corollary 8.2.1, p.139]. □ 
If we choose N to be large enough so that if 

w(x) — # {p < x : p is prime} 
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then 

AT 1 / 2 
1/2N iV 



v{N 1 / 2 ) > 



21ogAT 1 /2 



then by taking V in Lemma 10 to consist of all primes between 43 and TV 1 / 2 we see 
that 

and so Lemma 10 tells us that 

(6) |S| < 

The next lemma makes these endeavors relevant to our study of integral points on 
elliptic curves. 

Lemma 11. Let E be an elliptic curve in Weierstrass form, M <G Z and let S be 
the set 

S = {x(Q) € Z : 3 Q = (x(Q),y(Q)) G E(Z)} n [M, M + N] 

for some N > 0. Let v > be such that |A E | = iV T '. T/ien |5| = O v (N 1 / 2 log AT) . 

Proof. We are going to show this via the large sieve and in particular the inequality 
(??). Let p be a prime such that p does not divide A^. Then E is minimal over 
Q p and the reduction of E modulo p is an elliptic curve E p . By the Hasse Bounds 
(Theorem ??) we have that 

E p {¥ p )\ <p+l + 2^. 

If p /| 6 then we can put i^p into short Weierstrass form. With at most three 
exceptions, the points of E p (¥ p ) pair up as a {Q, —Q} with both points sharing the 
same x-coordinatc. Thus 

p+l + VP 3 
IM< 2 +2 

and this is less than |p of p > 42. Let V be the set of primes less than N 1 / 2 exclud- 
ing those p less than 42 and those dividing Ap. Since there are O ( x^tygWn ) = 
0,,(log AT) of these we see that we can apply inequality (??) with a = | to attain 

(7) \S\ = O v (N 1 ' 2 logN). 

□ 
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The focus in this investigation is on uniformity and so we would like to remove 
the dependence on the size of A E when dealing with points in E(Z, N). We can do 
this by appealing to a result due to Heath-Brown. 

Lemma 12. Let F(x) £ Z[x] be an irreducible form in three variables of degree d 
with coprime coefficients. Let \\F\\ denote the maximum modulus of the coefficients 
ofF. Let C : F = 0. Then either C(Q, N) < d 2 or \\F\\ < N d(d+i){d+2)/2 _ 

Proof. [8, Theorem 4] □ 
This result holds a fortiori for integral points on elliptic curves and shows that 
either E(Z, N) is bounded by 9 or ||i?|| <C N 30 . In the former case we have a bound 
far better than the Bombieri-Pila bound and so are happy. In the latter case we 
see that since ||F|| < iV 30 we must have \A E \ < \\E\\ 6 < N lm . Thus we have 
an absolute bound on r\ which means that for our set up we can make the implied 
constant in Lemma 11 absolute. 

Proposition 2. Let E be an elliptic curve given by y 2 = x 3 +Ax+B with A,P> e Z. 
Let < M < N and e > 0. Suppose that either M > N 2 / 3+2e or \A\ > 7V 4 / 3+4£ . 
Then 

# (E{Z) n (([-N, —M] U [M, N]) x [-N, N})) = 0(N 1/3 - t/2 ). 

Proof. We can use Lemma 11 to attain such a bound if we were to know that 
the portion of E(R) which lies over the intervals [-N, —M] U [M, N] in fact lies 
over a small absolute number of intervals each of length less than N 2 / 3 ~ e . This 
phenomenon occurs for the portions of E(R) which have gradient of absolute value 
exceeding iV 1 ' 3+e . Thus we should investigate the gradient of E. We have 

dy _3x 2 + A 
dx" ~ 2y ' 

We are interested in the absolute value of the gradient being larger than N 1 ^ 3+e 
which occurs when we have 

\3x 2 + A\ > 2 \y\ N 1 ^ 3+e . 
Since \y\ < N see that this inequality holds if we have 
(8) \3x 2 + A\> N 4/3+e . 

This is the inequality which we will study. If A > then (8) holds if cither 
A > N 4 / 3+e or \x\ > N 2 / 3+e / 2 . The hypotheses of the proposition clearly im- 
ply that we are in one of these cases and so we are done for the case A > 0. 

Now suppose that A < 0. Let us begin by rewriting the equation for E as 



E : y 2 = x 3 - Cx + B 
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where C = \A\ so that the sign of the symbols is clearer. Suppose that C = \A\ < 
N 4 / 3+4e . Then M > N 2 / 3+2e whence 

3x 2 -C> 3iV 4 / 3+4e - C> 2N 4 / 3+4c . 



In this case we see that (8) holds and so we are done. Thus we can assume that 
C > N 4 / 3+4e . Suppose that (8) fails. This would occur if 3x 2 were too close to C 

so we shall write x = ±J ^ + t. Then we have 



2N i/3+e > |3x 2 -C| 



3 ± 



+ t 



C 



(9) 



6t ± 



Let R = ±y^. If Rt > then (9) implies that 

fc 



u\lj\t\ < 2iV 4 / 3+e . 



Recalling that C > N 4 / 3+4e , we have 

2 v / 3iV 4 /^ 



t < 



6VC 



< 



N 2 > 3 - 



This means that x lies in a interval of length at most 27V 2//3_£ which allows us to 
apply Lemma 11 to attain that there are no more than O (TV 1 / 3-6 / 2 ) such integer 
points. Let us now suppose that Rt < 0. We split into two cases: x > and x < 0. 

In the first case we can write x = J ^ + 1 where — J ^ < t < 0. Hence 



C 

"\ljt + M 2 



and putting this into (9) above yields 



2N 4 / 3+f - 2 2/ , 
t < < -^N 2/3 ~ 



As before, this fact combined with Lemma 11 tells us that there are at most 
O (jV 1 / 3 " 6 / 2 ) such integer points. This just leaves the case of x < 0. In this 

case we have x — — + t where < t < \J~^- Thus 



-6a/— i + 3^ 



>3i/£|*| 
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and so, as above, we have 

\t\ < -^N 2 ^. 

Hence we see that in this case as well there are no more than O (N 1 / 3 "^ 2 ) such 
integer points. We are now done since the portion of E(M) in the range given in 
the statement of the proposition can be split up into 3 connected arcs where the 
gradient is greater than iV 1 / 3 ^ (3 arcs since the degree of E is 3), these each lie 
over a region on the x-axis of length no more than 27V 2 / 3_e and the remaining part 
of the curve can be split into 3 regions corresponding to the cases which we dealt 
with above (i.e, the sign of A and the sign of Rt). Each of these regions lies above 
an interval of length less than ~^N 2 / 3 ~ e . Applying Lemma 11 to each of these 
regions and summing them together gives us the desired result. □ 

6. Proof of Main Theorem 

In this section we shall prove the Main Theorem first by showing it holds for 
curves in short Weierstrass form and then by reducing the more general case to 
this form. First of all, let 8 be the same as in Theorem 2. If we look at points 
Q e E(Z)n[-N, N} 2 such that |x(Q)| > N 2 ' 3+2S / k for some k > 2 then Proposition 
2 taken with e = 5/k tells us that there are no more than 

O (Vl/3-<V2fc^j 

of these. Furthermore, if \A\ > ]\]~ 4 /3+ 4S / k then we can again apply Proposition 2 
with the same e to get the same bound. 

Now, suppose that \A\ < N A / 3+iS / k and we restrict to points Q e E(Z) n [-N, N] 2 
with \x(Q)\ < N 2 / 3+2S l k then we see that 

\B\<\y 2 \ + \x 3 \ + \Ax\<N 2+<iS / k 

and so |A B | < N i+6( - 25 / k \ This allows us to apply Proposition 1 with the 5 of that 
Proposition taken to be 28 /k which in combination with the bounds of Helfgott 
and Venkatesh (Theorem 2) tell us that the number of integer points in this case 
is bounded by 

Q^ N (l/3+S/k)(l-8)+e^ = Q^ N l/3-5(k-3)/3k-5 2 /ky 

Since we have k > 2 we see that the above exponent is less than | . We can therefore 
take the maximum of | — ^ and | — <$^r — as the exponent for a bound on 
E(Z,N). Since we are not concerned with optimisation here we shall be content 
with noting that if we ignore the 8 2 /k term (which only acts in our favour) then 
the best value for k is | which gives us that E(Z,N) -C 0(N 1 ^ 3 ^ S ^ 9 ). In any case 
we have the following 
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Proposition 3. Let E be an elliptic curve given by the equation y 2 = x 3 + Ax + B 
where A,BeZ. Then there is a 5 > such that 

#E{Z,N) = 0(N 1/3 - s ). 

In order to prove the Main Theorem we shall need a result drawing on the work 
of Ellenberg and Venkatesh in [6] 

Lemma 13. Let f e Z[X, Y] be a homogeneous polynomial of degree d, let Z C P 2 
be the curve cut out by f. Then the number of rational points on Z of naive height 
no more than N is bounded by 

{N 2/d \\f\\~ 1/d2 + 1)N £ 

where \\f\\ denotes the £ 2 norm. 

Proof. This is the n = 1 case of [6, Proposition 2.1] □ 

Lemma 14. Let E, f be as in the statement of the Main Theorem so E : y 2 = 
x 3 + Cx 2 + Dx + F = f(x). Then if \C\ > N 1+6e we have that 

#E(Z,N) = O (V 1/3 ~ e ) . 

Proof. We shall use Lemma 13 in a fairly crude way. Let g{x,y) = y 2 — f(x). To 
g we associate the homogeneous form 

G(X, Y, Z) = Y 2 Z - X 3 - CN 2 X 2 Z - DNXZ - FZ 3 . 

(This homogenisation is found in [6, Remark 2.2]). Clearly the Z points of E in our 
box correspond to points on G(X, Y, N) — of height no greater than N. Hence 
we can use the bounds of Lemma 13 to get the bound 

(10) #E(Z, N) < {N 2 ' 3 ||G|r 1/9 + l)N e . 

Now, if \C\ > N 1+6e we see that ||G|| > N 3+6e and so (10) becomes 
#E(Z,N) < (N 2 ^ 3 N-^ 3 - 2e + l)N e = O (at 1 / 3 " 6 ) 

as required. □ 

Proof of Main Theorem. We begin by changing the coordinates in order to bring 
the curve first into a short Weierstrass form. We move from coordinates (x, y) to 
(X, Y) where 

X = 9x + 3CY = 27y. 



This gives us a new expression for E 
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E' : Y 2 = X 3 + Ax + B 

with A, B € Z. Clearly integral points of E are sent to integral points on E' . 
Moreover, points in #_E(Z,iV) arc sent to points of E in the box 

B = [-9JV + 3C*, 9iV + 3C] x [-27N, 27N}. 

Now, Lemma 14 allows us to assume that \C\ < N 1+6e for some e for otherwise we 
have the bound 

O (iV 1 / 3 ^) 

and so could take 6' — e. Thus we have that 

B C [-47V 1+6e ,4iV 1+6e ] 2 . 

By Proposition 3 we know that there is a S > such that the number of integral 
points in this box is 

O (^ N a+ee)/3-s^ 

and so by choosing e small enough we see that there is a 5' > such that 

#E(Z,N) = O . □ 

7. Extending the result 

So far we have been considering the number of integer points on an elliptic curve 
which fall inside a square box centred at the origin. Our Main Theorem applies 
in the case that the curve is of the shape y 2 = f(x). In this section we shall con- 
sider the problem of placing the box arbitrarily in the plane for curves of this shape. 

As before, we shall start by considering curves in short Weierstrass form. Let E be 
an elliptic curve in short Weierstrass form, N > and Q = (x(Q),y(Q)) <G E(Z). 
Let a, -q > be such that |x(Q)| = N a and |y(Q)| = N n . The main observation we 
shall exploit is contained in the following 

Lemma 15. Let E be an elliptic curve in short Weierstrass form with \ j(E)\ > e\ 
and \A\ < 7V 4? '/ 3 ~ £ ° for some e ,ei > 0. Let Q,a,r] be defined as above. Then for 
any e > and N > sufficiently large with respect to e, eo> £i we have 

3 " 2 

Proof. Since E has the form 

y 2 = x 3 + Ax + B 



we easily see that 
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N 2 V < N 3a + N a + < + + _ 

Suppose for a moment that we can assume that \B\ < DN 2r, ~ K for some D, k > 0. 
Then for N sufficiently large we would have 7V~ K < ^ and so 

N 2v < 2 ^iV 3a + 7V a+4r '/ 3 ~ £o ^ . 

It is not hard to see that the lemma would follow from such an inequality so long as 
k and D are bounded in terms of e, eo, t\. Thus we just need to control \B\. Since 
\j(E)\ > ei we have that 

(II) 1728(4 |A| 3 ) > ei |4A 3 + 27B 2 | . 

If A > then (11) yields 



\b\ < 2 y 172 2 8 7e 61 ^ 3/2 < d c1 n^-^ 2 

which is as we wanted. Now suppose that A < 0. Set C = Then from (11) we 
have 

1728(4C 3 ) > ei |27B 2 - 4C 3 | . 
Suppose that 27 B 2 > AC 3 . Then we have 



\B\ < 2 / 1728 + 61 C 3/2 < £) jy2 v -3e /2 

1 1 " V 27ei - £1 

as required. If, on the other hand, 27£? 2 < 4C 3 then 

^ 2^3/2 ^ 2 N2 _ 3eo/2 

which is as we want. □ 
We shall now use this lemma to prove Theorem 1. 



Theorem 1. Let E be an elliptic curve given by the equation y 2 = f(x) where 
f £ Z[x] is a monic cubic with no repeated roots. Suppose further 04(E) < and 
j(E) > e for some e > 0. Let B be any box in the plane with sides of length N . 
Then there is a 8 > (depending only one) such that # (E(Z) n B) = 0(7V 1 / 3 ~ <5 ). 

Proof. We shall begin by assuming that E is in short Weierstrass form and then 
derive the more general statement from this. So, let E be cut out by the equation 



E : y 2 = x 3 + Ax + B 
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and let B be a box of size N by N somewhere in the plane. The aim is to get a 
bound on #E(Z) n B of the quality 0(N 1/3 ~ e ) for some e > 0. We shall be using 
the assumption that A > (which is the same as assuming 04(E) < 0). We would 
like to consider where gradient considerations and the Large Sieve can give us such 
a result via Lemma 11. To do this we need to be able to control the size of A#. 
Suppose that B is centred at the integer point (#0,2/0) • Then by the change of 
variables 



X 



£0 Y = y - yo 



we can move B to be centred at the origin. This has the effect of changing our 
equation to 

E' : Y 2 + 2Yy + y 2 = X 3 + 3X 2 x 3Xx 2 + xf, + AX + Ax + B. 

It is clear then that ||£"|| > max{|y | , |^o} and so by Lemma 13 we see that if 
either \xq\ or \y \ are greater than N 3+1Se then E'(Z,N) <g; N 1 / 3 ~ e which implies 
the result of this Theorem. Hence we may assume that both are bounded by 
7V 3+18c . In this case we have that the points E(Z) (~l B (note that this is E not 
E') are counted by E(Z, 2N 3+18e ). Applying Heath-Brown's result (Lemma 12) we 
see that we have \\E\\ < jv 30 ( 3 + 18£ ) which in turn means that |A S | < 7v 18 °( 3+18e '. 
This bound for the size of is good enough for our needs since the exponent of 
N is absolute. We can now proceed with the use of the Large Sieve technique. We 
have 



3x 2 + A 

Since our box is of size TV by TV we see by the same reasoning as before that the 



dy 




3x 2 + A 


> 


dx 




2y 





Large Sieve delivers the bound we want if we have 
assume that this is not the case. Thus we have 



> tv 1 / 34 " 26 hence we shall 



Since A > we learn that 



and 



\3x 2 + A\ < 2N 1/3+71+t . 



< jvl/6+r,/2+£/2 



\A\ < 2N 1/3+ri+f -. 



We wish to apply Lemma 15 and so need to check that the hypothesis on A is 
satisfied. This would be the case if 



1 4r/ 
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for some e > 0- This is true if and only if 

j] > l + 3(e + e )- 

If this inequality were false then by a change of coordinates we can move the box 
horizontally so that it lies over the interval [-N, N]. This would have the effect of 
putting the curve into the form y 2 = f(x) and so we could apply the Main Theorem 
with a box of size N 1+3 ^ e+e °\ So long as e, eo were small enough (with respect to 
the S in the Main Theorem) we could get the type of bound we want. Thus we see 
that we may assume the hypothesis on A required for Lemma 15 safely. We thus 
apply the lemma with a = 1+3 g+ 3£ to attain 

1 < — + 1 + — 
3 ~ 12 4 4 

and so 

r, < 1 + 15e 

which, by the same argument as above, can be dealt with by the Main Theorem 
for e sufficiently small. This concludes the proof of the Theorem for the case where 
E is in short Weierstrass form. 

Suppose now that E is given by y 2 — f(x) where / is as in the statement of 
the Theorem. Let B be a box of size N by N anywhere in the plane. By an obvious 
change of coordinates we can move B so that it is centred at the origin. This has 
the effect of changing the equation for E into something of the form 

y 2 + ay = g(x) 

where a € Z and g € Z[x] is a monic cubic. By the same application of the results 
of Ellenberg and Venkatesh (Lemma 13) as we used in the proof of the Main The- 
orem we can replace g with a cubic of the form x 3 + Ax + B and at the expense 
of increasing the width of B to 47V 1+6e for some e > which we can make as small 
as we like. By applying one last change of variables we can eliminate the ay term 
which results in the box being moved vertically and replaces our equation with one 
in short Weierstrass form. This has been dealt with already and so we are done. □ 

We shall end this section by making a few remarks about why Theorem 1 has 
the hypotheses it does and why we cannot remove them at present. Firstly, why 
the shape of the equation is y 2 = f(x) rather than the more general Weierstrass 
form of y 2 + a\xy + a 3 y = f(x). It is clear that we can deal with equations of the 
form y 2 + a 3 y = f(x) with Theorem 1 (so long as the other hypotheses are met) as 
was seen above in the proof of Theorem 1. The reason we cannot deduce results on 
the long form from the form in Theorem 1 is that the change of variables 



Y = y+ —x , X = x 
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which carries the equation y 2 + a\xy + a 3 y = f(x) to one of the form Y 2 + a 3 Y — 
F(X) has the effect of changing the box [-N, N] 2 (in which we are performing our 
count) to a parallelogram which has width along the X-axis of 2N but height on 
the F-axis of 0(a\N). We can get a control on |ai| from the bounds of Ellenberg 
and Venkatesh as we did in Lemma 13 yielding that we have an improvement on 
the Bombieri-Pila bounds if |oi| > N 1+e for some e > 0. However, this control is 
not strong enough to get a result in the case |ai| < N 1+e by change of variables. 
This is because the parallelogram mentioned above could have vertical height up 
to 0(N 2+e ) which is too large for our methods to apply. 

As for the hypotheses \j(E)\ > e and c A (E) < 0, these can both be understood 
in terms of our application of the large sieve. The idea is that the large sieve helps 
when the gradient of E(R) is particularly steep. Let us consider the case where 
j(E) — 0. These are the Mordell curves Ed ■ y 2 — x 3 + D. If we consider the cases 
where D > and indeed is substatially larger than N then when we look at the 
portion of Ed(M.) which lies above the interval [-N, N] on the x-axis, we see that 
it has a rather flat slope. It turns out that for many values of D (large in terms of 
N) this slope is too shallow for us to apply the large sieve and so we cannot get our 
result. The same problem arises for curves with 04(E) < 0. It may be that there 
is a more subtle way of using the sieve which can help here but it is my opinion 
that a new method is needed to deal with both these hypotheses and the case of 
the general Weierstrass form. 

8. An application to degree 1 del Pezzo surfaces 

A del Pezzo surface of degree d defined over Q is a smooth, projective algebraic 
surface which has ample anticanonical bundle —K x and where the self-intersection 
of Kx with itself has intersection number K x = d. For these surfaces we may take 
the height Hx associated with the anticanonical embedding and study the counting 
function 

X(Q, N) = # {P G X(Q) : H X (P) < N} . 

The main conjecture in this area is Manin's Conjecture [2] which in the case of del 
Pezzo surfaces claims that there is a Zariski open subset U C X so that one has 

U(Q,N) ~ cxN(logN) Px ~ 1 

where cx is a constant depending on X and px is the Picard rank of X. There 
is an industry in proving Manin's Conjecture for specific del Pezzo surfaces, often 
using delicate arguments from analytic number theory (for an overview see [4]) and 
from experience there it is understood that the problem becomes more difficult 
the lower the degree of the del Pezzo surface. Evidence for this comes from the 
fact that Manin's Conjecture has not been verified for a single degree 1 del Pezzo 
surface. In terms of upper bounds for X(Q,N), little is known in general. That 
the results proved thus far in this paper can serve to do this was pointed out to me 
independently by Daniel Loughran and Pierre Le Boudec. 
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Proposition 4. Let X be a degree 1 del Pezzo suface defined overQ and let N > 0. 
Then there is a S > such that X(Q, N) <C N 3 ~ s where the implied constant is 
uniform in X . 

Proof. It is known (eg, [1, § 2.2.3]) that X can be embedded into weighted projective 
space Pq(3, 2, 1, 1) as a smooth sextic of the shape 

(12) S :y 2 = x 3 + F 4 (u,v)x + F 6 {u,v) 

where y,x,u,v have weights 3,2,1,1 respectively and where Fj are homogeneous 
polynomials of degree i. In this setting, the counting function X(Q, N) is bounded 
by the size of the set 

S(N) = {P = (y, x, u, v) e S(Q) : x, y, u, v € Z, |z| < iV 2 , \y\ < N 3 , \u\ < N, \v\ < N} . 

For each choice of a pair (u,v) € Z 2 with \u\ , \v\ < N (of which there are clearly 
O (N 2 )) we have that (12) specialises to the equation for a cubic curve in short 
Weierstrass form with integer coefficients which we shall denote This curve 

will be an elliptic curve so long as we have 

A(E {U , V) ) = -16(4F 4 (u, vf + 27F 6 (u, vf) ? 0. 

The pairs for which A(Er UtV \) = are integer points lying on a degree 12 curve 
in Aq and so by the Bombicri-Pila bounds the number of these points for which 
(u, v) <E [-N, N] 2 is bounded by O e (iV 1 / 12+e ). Let (u, v) be such a pair and consider 
then the curve Ei u>v y All of the points (x, y) <G Ef u<v \(Z) for which (y, x, u, v) lie in 
S(N) are within the box [— iV 3 ,iV 3 ] 2 and so applying Bombicri-Pila again we see 
that E {u ^ v) (Z,N 3 ) < e N 1+t . Thus these curves contribute at most O e (jV 13 / 12+£ ) 
points to S(N). Since we are interested in a bound of the quality O (iV 2_<5 ) then 
we see that these points are insignificant to the count so long as 6 < 11/12 — e 
(which it will be). 

Now, let (u, v) be a pair such that Er UjV ) is an elliptic curve. By the Main Theorem 
we see that E^ u iV 3 ) <C N 1 ^ 5 for some 5 > 0. Taking into account the different 
choices for the pair (u, v) wc attain the desired bound on X(Q, N). Of course this 
5 is thrice that of the 5 in the Main Theorem. □ 

In certain special cases then there are some results due to Munshi in [12] who 
shows that for del Pezzo surfaces of the form 

Si : y 2 = x 3 + F 4 (u, v)x 
S 2 : y 2 =x 3 + b(F 2 (u,v)) 3 
S 3 : y 2 =x 3 + b(F 3 (u,v)) 2 
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where b € Z then one can obtain Si(Q,N) <Cb N 2+e where the implied constant 
depends on b for the last two examples but not on the Fj in any case. These follow 
from the fact that we have a non-trivial torsion section of these elliptic surfaces over 
either Q in the first case, Q(\/&) in the second and over Q(Vb) in the third. Such 
an approach will not work in general since we cannot hope for a torsion section of 
X to exist over a fixed number field K/Q in general. We also mention that through 
a different approach Munshi also shows that for 

X:y 2 =x 3 + (F 2 (u,v)f 

one can obtain X(Q,N) -C 7V 4 / 3+£ . We note that whilst our results are not as 
strong, they are hold for any such surface and are completely uniform in terms of 
the surface X. 



9. Concluding Remarks 

It would be desirable to remove the hypotheses in Theorem 1 and to extend it 
curves in long Weierstrass form however the methods employed in this paper do not 
seem to be able to handle these extensions. We also note that our methods make 
use of a lot of information regarding elliptic curves, most importantly the explicit 
equations for the local canonical heights. This means that the generalisation of 
the methods seen here to higher genus curves would not be trivial. It has been 
suggested that the correct setting for such arguments would be within Arakelov 
Theory. It would be interesting to see if such a reformulation of these ideas in that 
domain could lead to generalisations of the results seen here both to number fields 
and to higher genus curves. 
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